Promoter Region-Based Classification of Genes

نویسندگان

  • Paul Pavlidis
  • Terrence S. Furey
  • M. Liberto
  • David Haussler
  • William Noble Grundy
چکیده

In this paper we consider the problem of extracting information from the upstream untranslated regions of genes to make predictions about their transcriptional regulation. We present a method for classifying genes based on motif-based hidden Markov models (HMMs) of their promoter regions. Sequence motifs discovered in yeast promoters are used to construct HMMs that include parameters describing the number and relative locations of motifs within each sequence. Each model provides a Fisher kernel for a support vector machine, which can be used to predict the classifications of unannotated promoters. We demonstrate this method on two classes of genes from the budding yeast, S. cerevisiae. Our results suggest that the additional sequence features captured by the HMM assist in correctly classifying promoters.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of SFL1 and SFL2 Promoter Region in Arabidipsis thaliana using Gateway Cloning System

SFL1 and SFl2 (SETH Four Like) genes are two members of SETH4 gene family in Arabidopsis thaliana expressed in saprophytic tissues. In this study, expression of SFL1 and SFL2 genes were studied using Gateway Cloning Technology. Primers were designed for PCR amplification of promoter region of SFL1 (900 bp) and SFL2 (930 bp) genes having attB1 recombination sites using Kod Hi Fi DNA polymerase e...

متن کامل

Gene regulation network fitting of genes involved in the pathophysiology of fatty liver in the mice by promoter mining

Background and Aim: Non-Alcoholic Fatty Liver Disease (NAFLD) is the major cause of chronic liver disease in developed countries. In this study, we identified the most important transcription factors and biological mechanisms affecting the incidence of fatty liver disease using the promoter region data mining. Materials and Methods In this study, at first, the marker genes associated with this...

متن کامل

E-cadherin Promoter Methylation Comparison and Correlation with the Pathological Features of the Squamous Cell Carcinoma of Esophagus in the High Risk Region

E-cadherin is among tumor suppressor genes which mostly subjects to the down-regulation in squamous cell carcinoma of esophagus (SCCE). The gene is tightly associated with the tumor invasion and metastasis in multiple human cancers, especially SCCE. CpG islands’ methylation in the promoter region of E-cadherin is among the mechanisms that have been suggested for the E-cadherin silencing, howeve...

متن کامل

Independence of color intensity variation in red flesh apples from the number of repeat units in promoter region of the MdMYB10 gene as an allele to MdMYB1 and MdMYBA

MdMYB10 gene expression results in accumulation of anthocyanin in many tissues including flesh of applefruit. The MdMYB1 and MdMYBA genes are close homologues to MdMYB10 gene and both are responsiblefor red color phenotype in apple fruit skin. In the current study, an apple genome sequence draft analysisindicated that these three genes are located in a unique contig. Further a...

متن کامل

Comparison of Promoter Sequences of Flowering Control Genes, FT1 and Three Versions of VIN3, in Susceptible and Resistant Sugar Beet Genotypes to Bolting

Autumn sowing of sugar beet is a suitable way in sustainable agriculture. Bolting is an undesirable phenomenon which reduces sugar beet yield and it is the most important limiting factor in autumn sowing of sugar beet. Identification and comparison of the sequence of flowering genes in various genotypes can help to understand the molecular mechanisms controlling bolting. In the previous studies...

متن کامل

In silico screening of G-Quadruplex Structures in Wilms tumor 1 Gene Promoter

Introduction: X-ray diffraction studies have revealed that guanines in a DNA stands may be arranged in quartet and form a structure called G-quadruplexs. Bioinformatics studies suggested the formation of G-quadruplex structure in human crucial genes, including Wilms tumor 1 (WT1). The aim of this study was to in silico analysis of the guanine-rich sequence in the promoter region of the WT1 gene...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

دوره   شماره 

صفحات  -

تاریخ انتشار 2001